Cpu Tlb Stream Buffers L 1 Cache Stream Allocator

نویسندگان

  • Sridhar Gopal
  • Rahul Kapoor
  • S S Sastry
چکیده

We explore a variation of Jouppi's stream buuers scheme that reduces the memory stall time by prefetching lines on cache misses. In Jouppi's scheme since the stream buuers are oo-chip, a penalty is incurred even when the required line is available in the stream buuers. We propose moving the stream buuers on-chip to reduce this penalty. Also a line that hits in the stream buuer is not moved into the cache immediately unless the data is identiied to exhibit temporal locality. We use a stream buuer allocator that integrates the ltering and stride detection mechanisms used by Palacharla. We present trace driven simulation results for some of the SPEC92 benchmarks that demonstrate the eeectiveness of our scheme. The results show that there are improvements over a normal cache in all cases and for some benchmarks our scheme reduces the number of oo-chip memory fetches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting and Prefetching Tlb Entries from Irregular Access Streams

This thesis describes the design and evaluation the Irregular Translation Buffer (ITB), which is a prefetcher for translation lookaside buffer (TLB) entries that is based on the Irregular Stream Buffer (ISB) introduced by Jain and Lin [1]. The main idea to use the ISB's improvements in prefetching cache lines from irregular access streams and use them to prefetch entries from irregular access s...

متن کامل

Stride-directed Prefetching for Secondary Caches

Thi s paper studies hardware pre fe tch ingfor second-level ( L 2 ) caches. Previous work o n prefetching has been extensive but largely directed a t p r imary caches. In some cases only L 2 prefetching i s possible or i s more appropriate. B y s tudying L2 prefetching characterist ics we show that existing stride-directed methods [l, 81 for L1 caches do no t work as well in L2 caches. W e prop...

متن کامل

Further Cache and TLB Investigation of the RAMpage Memory Hierarchy

The RAMpage memory hierarchy is an alternative to the traditional division between cache and main memory: main memory is moved up a level and DRAM is used as a paging device. Earlier RAMpage work has shown that the RAMpage model scales up better with the growing CPU-DRAM speed gap, especially when context switches are taken on misses. This paper investigates the effect of more aggressive first-...

متن کامل

Towards Smart Cache Management for Ontology Based, History-Aware Stream Reasoning

Stream reasoning is an exciting multidisciplinary research area that combines stream processing and semantic reasoning. Its goal is to not only process a dynamic data stream, but also to extract explicit and implicit information on-the-fly. One of its challenges is managing history awareness: how much and which historical data should be held and for how long as we continuously query and reason ...

متن کامل

The Impact of Timeliness for Hardware-based Prefetching from Main Memory

Among the techniques to hide or tolerate memory latency, data prefetching has been shown to be quite effective. However, this efficiency is often limited to prefetching into the first-level cache. With more aggressive architectural parameters in current and future processors, prefetching from main memory to the second-level (L2) cache becomes increasingly more important. In this paper, we exami...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996